Unsupervised Classification of Functions using Dirichlet Process Mixtures of Gaussian Processes
نویسندگان
چکیده
This technical report presents a novel algorithm for unsupervised clustering of functions. It proceeds by developing the theory of unsupervised classification in mixtures from the familiar mixture of Gaussian distributions, to the infinite mixture of Gaussian processes. At each stage a both a theoretical and an algorithmic exposition are presented. We consider unsupervised classification (or clustering) of functions where the functions are not limited to parametric forms and the number of function clusters is initially unknown. We approach this problem by considering an hierarchical model in which the functions are realisations from an infinite mixture of Gaussian processes, which is drawn from a Dirichlet process mixture. The primary difficulty in this approach is sampling the latent variables associating functions with their cognate Gaussian process, as this amounts to sampling from an infinite mixture. We show this may be achieved using the technique of retrospective sampling. Finally we present empirical data using both a synthetic problem and cell-cycle mRNA expression time-course data.
منابع مشابه
Learning Community-Based Preferences via Dirichlet Process Mixtures of Gaussian Processes
Bayesian approaches to preference learning using Gaussian Processes (GPs) are attractive due to their ability to explicitly model uncertainty in users’ latent utility functions; unfortunately existing techniques have cubic time complexity in the number of users, which renders this approach intractable for collaborative preference learning over a large user base. Exploiting the observation that ...
متن کاملHyper Markov Non-Parametric Processes for Mixture Modeling and Model Selection
Markov distributions describe multivariate data with conditional independence structures. Dawid and Lauritzen (1993) extended this idea to hyper Markov laws for prior distributions. A hyper Markov law is a distribution over Markov distributions whose marginals satisfy the same conditional independence constraints. These laws have been used for Gaussian mixtures (Escobar, 1994; Escobar and West,...
متن کاملUnsupervised classification and analysis of objects described by nonparametric probability distributions
Various objects can be summarily described by probability distributions: groups of raw data, paths of stochastic processes, neighborhoods of an image pixel and so on. Dealing with nonparametric distributions, we propose a method for classifying such objects by estimating a finite mixture of Dirichlet distributions when the observed distributions are assumed to be outcomes of a finite mixture of...
متن کاملTowards Large-scale Occupancy Map Building using Dirichlet and Gaussian Processes
This paper proposes a new method for building occupancy maps using Dirichlet and Gaussian processes. We consider occupancy map building as a classification problem and apply Gaussian processes. The main drawback of Gaussian processes, however, is the computational complexity of O(n) related to the matrix inversion, where n is the number of data points. To enable large-scale occupancy map buildi...
متن کاملMesh Segmentation Using Laplacian Eigenvectors and Gaussian Mixtures
In this paper a new completely unsupervised mesh segmentation algorithm is proposed, which is based on the PCA interpretation of the Laplacian eigenvectors of the mesh and on parametric clustering using Gaussian mixtures. We analyse the geometric properties of these vectors and we devise a practical method that combines single-vector analysis with multiple-vector analysis. We attempt to charact...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006